Invited Talk: Processing Broadcast Audio For Information Access
نویسندگان
چکیده
This paper addresses recent progress in speaker-independent, large vocabulary, continuous speech recognition, which has opened up a wide range of near and mid-term applications. One rapidly expanding application area is the processing of broadcast audio for information access. At LIMSI, broadcast news transcription systems have been developed for English, French, German, Mandarin and Portuguese, and systems for other languages are under development. Audio indexation must take into account the specificities of audio data, such as needing to deal with the continuous data stream and an imperfect word transcription. Some near-term applications areas are audio data mining, selective dissemination of information and media monitoring.
منابع مشابه
Processing Broadcast Audio for Information Access
This paper addresses recent progress in speaker-independent, large vocabulary, continuous speech recognition, which has opened up a wide range of near and mid-term applications. One rapidly expanding application area is the processing of broadcast audio for information access. At LIMSI, broadcast news transcription systems have been developed for English, French, German, Mandarin and Portuguese...
متن کاملThe Effect of Broadcast Digitalization on Agricultural Information Dissemination in Nigeria.
Broadcast digitalization with its enormous benefits to the broadcasting industry will improve the quality of content of programs delivered by television stations. Africa has a switchover date of June, 2017. For Nigerians to have access to television broadcast once the switch over is completed, they must purchase high definition television sets or the set-up box. The awareness among urban dwelle...
متن کاملStructuring Broadcast Audio for Information Access
One rapidly expanding application area for state-of-the-art speech recognition technology is the automatic processing of broadcast audiovisual data for information access. Since much of the linguistic information is found in the audio channel, speech recognition is a key enabling technology which, when combined with information retrieval techniques, can be used for searching large audiovisual d...
متن کاملMultimedia interaction for the new millennium
Spoken language processing has created value in multiple application areas such as document transcription, data base entry, and command and control. Recently scientists have been focusing on a new class of application that promises on-demand access to multimedia information such as radio and broadcast news. In separate research, augmenting traditional graphical interfaces with additional modali...
متن کاملSpectral cross-correlation features for audio indexing of broadcast news and meetings
This paper describes the effect of three new acoustic feature parameters to detect audio source segments that are based on spectral cross-correlation: spectral stability, white noise similarity, and sound spectral shape. These parameters are devised for accurate audio source detection and are used in a pre-processing module for automatic indexing of the broadcast news and the meetings. We condu...
متن کامل